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(57) Abstract 

The present invention provides an improved method and apparatus for translating a source language to a target language. The 
invention uses placeables (e.g., proper nouns, titles and names, dates, times, units and measurements, numbers, formatting information, such 
as tags or escape sequences, styles, graphics, hyperlinks) to assist a translator by not having to retype information that does not need to be 
translated and to provide conversions to the target locale if necessary like for speeds. ^ 
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MACHINE-ASSISTED TRANSLATION TOOLS 
BACKGROUND OF THE INVENTION 
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1. Field of the Invention 

The present invention relates to machine processing of text and 
5 language and, more particularly, to a method and apparatus including a 
software implementation for machine-assisted translation or machine 
translation. 

2. Discussion of the Related Technology 

Translation of text from one language to another is often a tedious task 

10 requiring the efforts of a skilled translator. Soon after the advent of 
computers, researchers began to use computers as an aid for natural language 
translation. The earliest machine translation (MT) systems relied on large 
bilingual dictionaries where entries for words of the source language (SL) 
gave one or more equivalents in the target language (TL). It quickly became 

15 apparent that dictionary rules for syntax and grammar were so complex that 
experts could not develop a comprehensive set of rules to describe the 
machine translation have been abandoned. 

Throughout the world, multilingual cultures and multinational trade 
create an increasing demand for translation services. The demand for 

20 translation of commercial and technical documents represents a large and 
growing segment of the translation market. Examples of such documents are 
contracts, instruction manuals, forms, and computer software. Often when 
a product or service is "localized" for a new market, a great deal of 
documentation must be translated, creating a need for cost-effective 

25 translation. Because commercial and technical information is often detailed 
and precise, accurate translations continue to be in demand. 

Machine translation (MT) systems are usually classified as either 
direct, transfer-based, or interlingua-based. In the direct approach, there are 
no intermediate representations between the source language and the target 
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language. The source language text is processed "directly in order to 
transform it into the target text. This process is essentially a word-to-word 
translation with some adjustments. This approach is not followed by any MT 
system at present due to a perceived weakness attributable to ignoring all 
5 aspects of the internal structure of sentences. 

In the transfer-b2ised approach, information from the various stages of 
analysis from the source text is transferred to the corresponding stages of the 
generation of the target text. For example, transfer is achieved by setting up 
correspondence at the lexical level, at the grammatical level, or at the level 

10 of the structure built by the grammar, and so forth. The transfer method 
operates only on a particular pair of languages and, therefore, must be 
specifically and painsfaikingly created for each pair of languages. 

The interlingua-based approach depends upon an Eissumption that a 
suitable intermediate representation can be defined such that the source text 

15 can be mapped into the intermediate representation which can then be 
mapped into the target text. In principle, this approach is clearly attractive 
because, unlike the transfer-based approach, it is not necessary to build a 
separate transfer program for each pair of languages. However, it is not clear 
whether a truly language-independent intermediate representation can be 

20 devised. Current interlingua-based systems are much less ambitious about 
their claims to the imiversality of the intermediate representation. For a 
high-quality translation, it is often necessary to have access to some 
particular aspects of the source and target languages. 

In the transfer-ba^ed approach, there have been some recent advances. 

25 In the development of mathematical and computational models of grammar, 
there is increasing emphasis on locating syntactic as well as semantic 
information directly with the lexical items by associating structures with the 
lexical items and defining operations for composing these objects. From this 
perspective, all the information particular to a language is encapsulated in the 

30 lexical items and the structures associated with them. Different languages 
will be distinguished at this level, but not with respect to the operations for 
.composing these structures, which are the same for all languages. The idea, 
then, is to define all bilingual correspondence at this level. It remains to be 
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seen whether this approach can be carried out among a variety of different 
languages. 

Some existing MT systems require that documents be written in highly 
constrained texts. Such systems are useful for preparing manuals in 
different languages. Here, the system is really not translating a manual 
written in one natural language into a set of other natural languages, but 
rather is generating multilingual texts from a highly constrained text, thus 
avoiding many problems in conventional MT. 

Recently, research has focused on ways of using machines to assist 
human translators rather than to autonomously perform translations. This 
approach is referred to as machine-assisted human translation (MAHT). 
Systems are available that produce high-quality translation of business 
correspondence using pre-translated fragments with some translations filled 
in by human translators. An example of a machine-assisted translation tool 
is a translation memory (TM) system. Translation memory systems leave the 
creative work to the translator, however they can learn from the translator, 
and they actively support the translation process by automatically suggesting 
existing translations and terminoli^T: A t3:'anslation memory is a database 
that collects transIatioDS as they are performed, along with the soiirce 
language equivalents. After a number of translations have been performed 
and stored in .the translation memory, the translation memory can be 
accessed to assist new trsinslations where the new translations include 
identical or similar source Isinguage text as had been included in the 
translation memory. 

The advantage of such a system is that it can, in theory, leverage 
existing MT technology to make the translator more efficient without 
sacrificing the traditional accuracy provided by a human translator. The 
system makes translations more efficient by ensuring that the translator 
never has to translate the same source text twice. While a translator works, 
translation memory operates in the background to 'learn' original sentences 
and their corresponding translations. In the process, this data may be linked 
into the neural network. Later, translation memory rapidly finds identical 
or similar sentences and automatically displays them as a working basis for 

-3- 
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creating a new translation. Thus, translation memory ensures that no 
sentence need be translated twice. 

Translation memories are most useful when they are able to locate not 
only identical matches, but also approximate or "fuzzy matches." Fuzzy 
matching facilitates retrieval of text that differs slightly in word order, 
morphology, case, or spelling. The approximate matching is necessary 
because of the large variety possible in natural language texts. Fuzzy 
matching to find sentences vnth similar content has seen its performance 
perfected by the implementation of neural network technology. The 
translator has the option of choosing among alternative translations in 
addition to the one automatically suggested by memory. Along with the 
source sentence and its translation, each translation unit can also- store 
information on users, dates and frequency of use, and classifying attributes 
and text fields. This information enables easy maintenance of translation 
memories, which naturally become quite large over time. 

Concordances are another tool conmionly vised by translators. 
Electronic concordances are files having text strings, Le,, words, phrases or 
sentences, that are matched with the context in which the word appeared in 
a particiilar docummt. When a translator is unsure of the meaning to be 
given a particular word, tiie concordance can demonstrate how the word is 
VLsed in several different contexts. This information allows for a more proper 
selection of translations to accurately reflect the meaning of a sovirce 
language document. Electronic concordances include text searching software 
that allows the translator to extract all text strings in a library that include 
a desired word or phrase. The extracted texts strings can be examined 
quickly to gain a greater understanding of how a particular word or phrase 
is used in context. 

Multilingual natural language processing represents a growing need 
and opportunity in the field of international commerce and communication. 
Machine-assisted translation tools are needed to make document translation 
more efficient and less costly. Furthermore, machine-assisted translation 
tools are needed that efficiently leverage the large amount of stored 
knowledge available as pre-translated commercial and technical documents. 

.4- 
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Specifically, a need exists for a translation memory tool that is language- 
independent and provides accurate, rapid fuzzy retrieval of pre-translated 
material. 

Up until now, text that was considered to be a placeable had to be 
5 translated and manually entered by the translator. Placeables are often re- 
used "as is" in the translated text or in a converted form. Examples of such 
placeables are: proper nouns, titles and names, dates, times, units and 
measurements, numbers, formatting information, such as tags or escape 
sequences, styles, graphics, hyperlinks, cross-references, automatic fields in 

10 text, or any other kind of information that will not be translated but, rather, 
converted without knowledge about the context. The translation of placeables 
is time-consuming and can lead to errors when conversions must be made for 
things such as currency, e.g., dollar to yen and speed, e.g., miles per hour may 
to kilometers per hour. There is a need for a program that identifies the text 

15 considered to be placeable, makes any necessary conversions, and inserts the 
placeable into the target text. 

SUMMARY OF THE INVENTION 
The present invention provides an improved method and apparatus for 
translating a bouxca langoage into a target langnage* The indention uses 

rgpW^>Tn<>rt^ ryf p j^xrctiaK'jag hn th^ tgrggj: laTTgtiagg a^-ai maMrig any neces^ry 
conversions according to the target locale, e^g., "German - Standard". A 
placeable as used herein is a term that designates data that does not reqiiire 
translation into a target language or, in some cases, data types that are 

25 particularly suitable for semiautomatic replacement (e.g., proper nouns, titles 
and names, formatting information, such as tags or escape sequences, styles, 
graphics) and data requiring a translation that does not change the context 
of the data (e.g., physical and currency units, time zones, date formats, 
hyperlinks etc.). In addition, a placeable may be more complex and advanced. 

30 For example, a placeable could be determined by specialized dictionaries 
and/or the context or environment information of the entire information 
designated for translation, e,g., data in the chemical environment, automotive 
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environment, music lyrics, legal environment. The context or environment 
information would then decide how certain tenns are translated. A 
source placeable identifier maybe used to identify the placeables in the source 
information, e.g., source locale, based on source concordance relating to the 
context and environment of the placeable. In translation memories, the 
placeable may be converted into a language-independent format, e.g., meta- 
representation. The language-independent format allows the translation 
memory to convert the placeable into any target language because the format 
is common to all locales. After conversion to the independent format, the 
placeable can be automatically or semiautomatically placed in the target 
translation. A target placeable converter is used to convert the placeables 
into target information , e.g., target locale, based on target concordance 
relating to the context arid environment of the placeable. 

A system, according to the invention, may identify a placeable and 
determine its type in order to facilitate subsequent handling of the placeable, 
typically to facilitate a decision on placing, converting, or translating the 
placeable. The identification of a placeable and determination of its type may 
be accomplished by a rule-based prpcess. In addition, the identification and 
determination process may be performed by or with the assistance of a finite 
state machine sosdi as table lookup functions or a character by character 
determinatioii. 

In database-driven TMs usm^mss i nven t i on, fegg is a nigH potential 
to reduce the amount of storage space needed to store the source and target 
units in pairs by storing the units as templates, or skeletons, together with 
the placeable information. 

An. object of the invention is to reduce the effort required of a 
translator to translate source information into target information by 
eliminating the need to manually type or move the placeable to a translation 
by allowing placeables to be inserted in the target information and to perform 
any desired conversions by means of a target placeable converter. 

Another object of the invention is to reduce the amount of time of 
effort required to translate source text by automatically converting placeables, 
e.^., dates, measurement units into target text for insertion. 

-6- 
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Another object of the invention is to reduce errors that may occur 
when a translator is manually converting measurement units in a source text 



to a target text by automatically converting placeable data. 

Another object of the invention is to reduce the amount of time or 
5 effort required to translate source text by automatically identifying formatting 
codes and inserting the codes into a target text. 

Another object of the invention is to reduce the amount of time or 
effort required to translate a source text into a target text by automatically 
translating hypertext links. 
10 Another object of the invention is to convert the placeables into a 

language-independent format. 

An object of the invention is to automatically change the appearance 
of placeable elements if appropriate, for example, by converting measurement 
units, date formats, currency values and units, titles and names, etc. 
. 15 An object of the invention is to semiautomatically insert placeable 



elements at a user-defined position in the target text upon interaction from 
the user, e.g., upon one or more keystrokes, upon one or more spoken 
commands, upon mouse clicks, etc, when translating source information. 

An object of the invention is to automatically insert placeable elements. 



user interaction. 

An object of the invention is that it can Is usd with xz^Enoal 
translation or with a translation memory. 



20 



with the help of x^isrence materfcai o r otfe^r7ng?r>yn=P<WTiiHst^ 
that allow &b ™»<^l^Trm to Hii^iW'i/'TniS riSg jr^g^^am Jbt'^^^- ^:sss. 




25 



BRIEF DESCRIPTION OF THE DRAWINGS 



FIG. 1 



shows an embodiment of the invention; 



30 



FIG. 4 



FIG. 5 



FIG. 3 



FIG. 2 



shows another embodiment of the invention; 
shows another embodiment of the invention; 
shows another embodiment of the invention; 
shows another embodiment of the invention. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
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The present invention may be carried out by a variety of 
commercially available language translation computer software programs. 
The invention can work with a translation memory, however, it is not a 
requirement. Advantageously, the system will support at least two 
5 languages. 

First, the system can receive input source translation information, 
such as text data or voice data, that may be entered or retrieved in various 
ways, {e.g., data file, scanned data, voice recording, voice dictation, etc.). 
The program may divide the source translation information into linguistic 
10 forms or data translation units. This can be accomplished by segmenting 
the source translation information into words, sentences, or paragraphs. 
This process can be system-designated or user-designated. 

Manual Translation 

FIG. 1 shows a flow chart illustrating a process according to the 

15 invention when a translator is not using a translation memory. Initially, 

input source translation information may be divided into segments, such as 
sentences. Elements of the segments are provided to the processor at 
location (110). The system determines whether an element is considered a 
placeable (120). While translating the text, the system will advise the 

20 translator that a data element is a plafiedbfe ai»i -mH aBow &e translator 
to ham this placesdile \ ■ L i ^ ;-iir^-;i5TO- -(1391 ^ ^ this pomt, the 

system may also determine ti^i^a- ^c? T^^^pggae inOT^ssr » ass^t tiie 
translator. The type infbrmaxzcKi be provided to the tran^ator in any 
suitable format, such as by a leading signal, color, font change, etc. In this 

25 method, the translator may determine where the placeable should be 
inserted, e.g., upon one or more keystrokes, upon one or more spoken 
commands, upon mouse click(s), etc. If the placeable requires a conversion 
to the target information, this system may be set to automatically convert 
the placeable based on type, when the user selects to drop the placeable 

30 into the target text (140). This conversion is performed by a target 

placeable converter according to location information of the target output 
{e.g., target locale, target dictionaries based on target 
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environment/context). The manual translation may be performed in a 
Windows environment. 

Translation Memory 

Translation memories are used for reference purposes when 
5 translating information. FIG. 2 shows a commercially available software 
program for managing reference material for translation when using a 
translation memory. The reference material may be collections of text, 
normally in two or more languages, whereby previously translated source 
text units are associated with target text units. The input designated for 

10 translation will be referred to as source translation information. As 

mentioned above, translation memories are most useful when they are able 
to locate not only identical matches, but also approximate or "fuzzy 
matches." Fuzzy matching facilitates retrieval of text that differs slightly 
from the source translation information. The translation memory can 

15 provide information to the translator that indicates how close the retrieved 
suggested information matches the source translation information. This 
information could be disseminated in the form of a numerical 
representation such as a percentage. In the event that a unit of text (i.e., 
source translation information) to be translated is identical, or very 

20 similar, to a soin-ce text unit occurring in Uie translation memory and 

which has been accxnrateiy translated sat sn esaxHer time, a retrieval system 
can show as a reference tiie trai^ation stored in the translatian memory 
as target test. The tr2Zisis:6sr can ccj^ ^« reSsrence unit and modify 
it to fit the new s umitq^ teszsfefem information. In prior systems, if a 

25 placeable, occurring in tia text to be translated, was different from a 

corresponding element of the translation memory source text, it would be 
necessary for a translator to manually transfer and, if necessary, translate 
or convert placeable data into the target information. 

The user interface of the translation software provides a program 

30 window (200) for displaying different items for the translator. This 

particular example shows the translation software program window (210) 
and a word processor's program window (240) at the same time so that 

-9- 



SUBSTITUTE SHEET (RULE 26) 



wo 99/57651 



PCT/EP99/02959 



the user can view the entire source translation information, or smaller 
units of interest, during the translation process. Item (240) can also be 
configured to display the final target information, e.g., the translated text, 
during the translation process. Item (210) shows an area (220) where the 
translation program may display the linguistic form or data translation 
unit of the source translation information designated for translation. In 
addition, window (210) may display some suggested translations for the 
linguistic form when using a translation memory tool (230), 

FIG. 3 and FIG. 5 illustrate the invention when a translator is using 
a translation memory. First, the linguistic units of source translation 
information may further be divided into tokens or source elements. Once 
a source element is identified as a placeable (310), its type is determined 
(330) {i.e,, date, time, link, etc.), and the placeable may then be converted 
into a language independent format (340), such as a meta-representation, 
or directly converted into a target language or locale. The meta- 
representation (420, 520) allows the system to convert the placeable into 
any target language because the format (meta-representation) is common 
to all locales. The meta-representation can be converted according to any 
target locale to produce a target placeable (360). At this point, the 
placeable may be inserted into the target language and any conversion may 
be made automatically or semiautomatically. 

Coissider tiie foHowing ^ntssce: ''A mm caDed Mr. BOler, left his 
apartiiient on the 25^ of Jamxaxy in a car that is cs^pable of driving at 
speeds above 160 mph.'' A machine translation program would face great 
difficulties in determining whether "A man called Mr. Miller on the 
phone" or whether "A man" is the same person as "Mr. Miller." This 
question can only be answered by evaluating the context. In other words, 
if one were to look at the word "called" alone, it would be virtually 
impossible to translate the sentence correctly. However, if the entire token 
(410) is considered: "A man, called Mr. Miller," then you (or a machine) 
could come up with a meaningful translation. FIG. 3 shows how the 
invention processes this sentence and how a placeable would be treated the 
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first time it was identified. Three placeables are identified: Mr. Miller, 
25''' of January, and 160 mph. 



5 



Identified Placeable 


Classification & 
language-independent 
formatting 


Transformation 
to be inserted in the 
target text 


Mr, Miller 


Placeable/Name w/Title 


Herr Miller 


25'"* of January 


Placeable/Date-longform 


25. Januar 


160 mph 


Placeable /number w/unit 


260 km/h 



When the system is presented with the foregoing text as part of an 
input of text to be translated {e.g., source translation information), the 
system will first segment the text, preferably into sentences. The 

10 foregoing sentence may then advantageously be tokenized. According to 
one embodiment of the invention, it can be tokenized into elements 
corresponding to the words or phrases in the source translation sentence. 
The tokenizing process will consider whether the elements may be 
identified as placeables according^ to a rafeis^^ fpsay astdfar v^lt2i tie xssr 

15 of fizxite state tools snxdi as k»3k-^ tz^^^ S^fc tbs of «*ansi± 
4i^«tijff^ as a xsia as toeszniBHi. Tiss d^enmnatmzi may tUso 

be accdznp]isii«i by a raie-baad inqpizy wihsr witfa the use of a finite 
state process such as by tables. The output of the tokenizer will include 
the non-placeable elements and an indication of the type of any placeable 

20 elements. This output may be provided to a translation memory in order 
to locate any identical or similar segments that have been previously 
translated. The system may propose a translation based on the previously 
translated target text located in the translation memory and direct 
placement with or without conversion -of the placeable elements. The 

25 system may advantageously be implemented by software in a general 
purpose computer such as a personal computer. 
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The determination of a placeable may be a one or two step method 
using a rule-based system/a finite state system. The one-step method may 
also be accomplished by determining the type using a rule-based system 
that views the entire token. For example: is this token a date, is this 
5 token a proper novm, is this token a hyperlink? The two step-method may 
first determine whether the token is a placeable and then determine its 
type. This may be accomplished using a finite state process that examines 
each character of a token one at a time until a determination is reached. 
One of the identifying features of a placeable is that its meaning is 

10 not likely to vary by context. Such placeables may include types that may 
be directly used in a translation, for example, numbers or graphics. Other 
t3^es of placeables may be used after conversion, such as numbers coupled 
with units. For example, a placeable appearing as "62 miles per hour" may 
be converted to "100 kilometers per hour". Such a conversion is 

15 distinguished from a translation by its formulaic nature. A formulaic 

conversion is suitable in situations where context is not likely to affect the 
translation. 

The following example illustrates the translation of text including a 
similar placeable in another source text. The translation memory system 
20 includes the following sotirce text unit plus its translation from the 
reference file(s): 

TraiisiatlDn Memcary &>UTce Text Unit: ''A msn, called Mr. SmiUi, Irft ids 
apartment on the 1st of April in a car that is capable of driving at speeds 
above 100 mph.'* 

^ Translation Memory Target Text Unit (German): ''Ein Mann, namens 
Herr Smith, verliess sein Apartment am 1. April in einem Auto, das 
schneller als 160 km/h fahren kann." 

New Text Unit to be translated is: "A man, called Mr. Miller, left his 
apartment on the 25*^ of January in a car that is capable of driving at 
30 speeds above 160 mph." 

As shown in FIG. 5., the new source text unit may be divided into 
elements (510). The placeables may then be classified and converted to a 
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language-independent format (520). When the software is able to correlate 
three placeables in the Translation Memory Target Text Unit with 
placeables in both the Translation Memory Source Text Unit and the New 
Text to be translated, a translation memory system using this invention 
can determine and propose the location for inserting the placeables 
automatically, that is, without any user interaction. In the above example, 
the system will be capable of determining that: 

I. The only difference between the Text Unit (to be translated) 
and the Translation Memory Source Text Unit (a similar text 
that has been translated earlier) is found in three tokens. 

II. All three tokens are placeables. 

III. The type (i.e., date, speed, name) of the placeable tokens are 
the same in Translation Memory Source Text Units and the 
text to be translated. 

IV. In the Did Target Text Unit, exactly the same number and 
type of placeable tokens can be found. The translation 
system may propose to reuse the previous translation 
(Translation Memory Target Text Unit) and replace the 
placeable tokens with the new name (=Mr. Miller), the new 
date (=25^ January), and the new speed (=160 mph). 

V. Th^ s of tg yar e irsa^ r? Tfr^--»^f^'^^-^^-« parts tis piaasgMg 

Storing Placeables 

There are two types of translation memory systems on the market 
today: reference file driven TMs and database driven TMs. Reference file 
driven TMs keep the source text and target text in two different locations, 
aligning the two by keeping a list of reference pointers to each other. In 
reference file driven systems, all source text units ever written (or 
otherwise created), and all translations thereof, may be physically stored 
and kept in files. In database driven TMs using this invention, there is a 
high potential to conserve data storage space by simply storing the text 
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We claim: 

1. A method for processing source information comprising the steps of: 
parsing input source information into elements; 

identifying placeable elements by a predetermined criteria; 
designating said placeable elements by type. 

2. A method for processing source information according to claim 1, 
further comprising the step of : 

determining a source locale. 

3. A method for processing source information according to claim 2, 
further comprising the step of : 

applying a source placeable identifier to determine said type of said 
element. 

4. A method for processing sorrree tnfor matkat HrtXH cilag to ci^ivn 1^ 
further comprising the step of : 

determining a target locale. 

5. A method for processing source information according to claim 1, 
further comprising the step of : 

applying a target placeable converter to convert said type of 
said element. 
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1 6. A method for processing source information according to claim 2, 

2 further comprising the step of : 

3 applying said source locale to determine said element by type. 

1 7. A method for processing source information according to claim 1, 

2 further comprising the step of : 

3 converting said element into a language-independent format. 

1 8. A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a proper noun; 

4 placing said placeable directly into a target output. 

1 9. A method for processing source information according to claim 1, 

2 fxirther comprising the steps of : 

3 determiziing iTtdiether said placeable is a date; 

^ A 1 n^t. uA^i gfffK l date izzto a laigeL tj*rMfM'iii.tM?» ^czssr&sig to a 

5 ta2^e£ locale information. 

1 10, A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a proper noun; 

4 converting said placeable into a language independent format. 
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1 11. A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a proper noim; 

4 converting said placeable into a meta-representation. 

1 12. A method for processing source information according to claim 1, 

2 further comprising the steps of : 

3 determining whether said placeable is a date; 

4 converting said placeable into a language independent format, 

1 13, A method for processing source information according to claim 1, 

2 further comprising the step of : 

3 determining whether said placeable requires conversion. 

1 14. A method for processing source information according to claim 1, 

2 £xr&e3r r^ T n p rjsi'ng the step of i 

3 4etenxLmiixg whrtlser said placeable is a pro^ r noun. 

1 15. A method for processing source information according to claim 1, 

2 ' further comprising the step of : 

3 determining whether said placeable is a date, 

1 16. A method for processing source information according to claim 1, 

2 further comprising the step of : 

3 determining output requirement for conversions. 
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17. A computer driven language processing system for processing source 
information comprising: 

a parser; 

an element identifier, connected to an output of said parser; 

a tgcpe designator, connected to an output of said element identifier. 

18. A computer driven language processing system for processing source 
information comprising: 

a parser for parsing source information into elements; 
an element identifier identifying placeable elements by a 
predetermined criteria; 

a t3^e designator for designating said placeable elements by type. 

19. A method for processing source information comprising the steps of: 
segmeiiting input source informaiicm into elements; 

MwaLifymgr placeable elements hy a psredetenziined cri!>*na; 

20. A method for processing soiu-ce information according to claim 8, 
further comprising the step of : 

comparing said placeable elements to a data set. 

21. A method for processing source information according to claim .8, 
further comprising the step of : 

comparing said placeable elements by type to a data set. 
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